PF-0532 US 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hillman, Jennifer L. 

Corley, Neil C. 
Baughn, Mariah R. 

(ii) TITLE OF INVENTION: DELTA l-PYRROLINE-5-CARBOXYLATE REDUCTASE 

HOMOLOG 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

~ (E) COUNTRY: USA 

y (F) ZIP: 94304 

*Q (v) COMPUTER READABLE FORM: 

M= (A) MEDIUM TYPE: Diskette 

ry (B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

s? (D) SOFTWARE: FastSEQ for Windows Version 2.0b 

1= 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: TO BE ASSIGNED 
O (B) FILING DATE: HEREWITH 

(C) CLASSIFICATION: 

\S" (vii) PRIOR APPLICATION DATA: 

+; (A) APPLICATION NUMBER: 

?□ (B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Cerrone, Michael C 

(B) REGISTRATION NUMBER: 39,132 

(C) REFERENCE /DOCKET NUMBER: PF-0532 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 650-855-0572 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : PROSNON01 
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(B) CLONE: 2278458 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Ser Val Gly Phe He Gly Ala Gly Gin Leu Ala Tyr Arg Phe Thr 

1 5 10 15 

Ala Ala Gly He Leu Ser Ala His Lys He He Ala Ser Ser Pro Glu 

20 25 30 

Met Asn Leu Pro Thr Val Ser Ala Leu Arg Lys Met Gly Val Asn Leu 

35 40 45 

Thr Arg Ser Asn Lys Glu Thr Val Lys His Ser Asp Val Leu Phe Leu 

50 55 60 

Ala Val Lys Pro His He He Pro Phe He Leu Asp Glu He Gly Ala 
65 70 75 80 

Asp Val Gin Ala Arg His He Val Val Ser Cys Ala Ala Gly Val Thr 

85 90 9S 

He Ser Ser Val Glu Lys Lys Leu Met Ala Phe Gin Pro Ala Pro Lys 

100 105 HO 

Val He Arg Cys Met Thr Asn Thr Pro Val Val Val Gin Glu Gly Ala 

115 120 125 

Thr Val Tyr Ala Thr Gly Thr His Ala Leu Val Glu Asp Gly Gin Leu 
« 130 135 140 

d Leu Glu Gin Leu Met Ser Ser Val Gly Phe Cys Thr Glu Val Glu Glu 

U 145 150 155 160 

Asp Leu He Asp Ala Val Thr Gly Leu Ser Gly Ser Gly Pro Ala Tyr 

165 170 175 

Ala Phe Met Ala Leu Asp Ala Asp Gly Gly Val Lys Met Gly Leu Pro 

180 185 190 

Arg Arg Leu Ala He Gin Leu Gly Ala Gin Ala Leu Leu Gly Ala Ala 

195 200 205 

Lys Met Leu Leu Asp Ser Glu Gin His Pro Cys Gin Leu Lys Asp Asn 

210 215 220 

Val Cys Ser Pro Gly Gly Ala Thr He His Ala Leu His Phe Leu Glu 
225 230 235 240 

Ser Gly Gly Phe Arg Ser Leu Leu He Asn Ala Val Glu Ala Ser Cys 

245 250 255 

He Arg Thr Arg Glu Leu Gin Ser Met Ala Asp Gin Glu Lys He Ser 

260 265 270 

Pro Ala Ala Leu Lys Lys Thr Leu Leu Asp Arg Val Lys Leu Glu Ser 

275 280 285 

Pro Thr Val Ser Thr Leu Thr Pro Ser Ser Pro Gly Lys Leu Leu Thr 

290 295 300 

Arg Ser Leu Ala Leu Gly Gly Lys Lys Asp 
305 310 

(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1742 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY : linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNON01 

(B) CLONE: 2278458 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCGCCATCAG CCGCCCGGGA GATATCCGCC GGGGGAGAAT AGGGTTGCAC CATCCCAGAA 
GCTGCTGTTA GCTCGCCGGT CCTCGGCACG CCGCCCGTTC GCCCCTGCGC TGTCCGCCCT 
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TCCCCTAGCG TTACTTCCGG TCCCTCGCTG AGGGGGTTCG TGCGGCTCCC AGGAGGCGTG 180 

AACCGCGGAC CATGAGCGTG GGCTTCATCG GGGCCGGCCA GCTGGCCTAT GCTCTGGCGC 240 

GGGGCTTCAC GGCCGCAGGC ATCCTGTCGG CTCACAAGAT AATAGCCAGC TCCCCAGAAA 3 00 

TGAACCTGCC CACGGTGTCC GCGCTCAGGA AGATGGGTGT GAACCTGACA CGCAGCAACA 360 

AGGAGACGGT GAAGCACAGC GACGTCCTGT TTCTGGCTGT GAAGCCACAT ATCATCCCCT 420 

TCATCCTGGA TGAGATTGGG GCCGACGTGC AAGCCAGACA CATCGTGGTC TCCTGTGCGG 4 80 

CTGGTGTCAC CATCAGCTCT GTGGAGAAGA AGCTGATGGC ATTCCAGCCA GCCCCCAAAG 540 

TGATTC GCTG CATGACCAAC ACACCTGTGG TAGTGCAGGA AGGCGCTACA GTGTACGCCA 600 

CGGGCACCCA TGCCCTGGTG GAGGATGGGC AGCTCCTGGA GCAGCTCATG AGCAGCGTGG 660 

GCTTCTGCAC TGAGGTGGAA GAGGACCTCA TCGATGCCGT CACGGGGCTC AGTGGCAGCG 720 

GGCCTGCCTA TGCATTCATG GCTCTGGACG CATTGGCTGA TGGTGGGGTG AAGATGGGTT 7 80 
TGCCACGGCG CCTGGCAATC CAACTCGGGG CCCAGGCTTT GCTGGGAGCT GCCAAGATGC 840 
TGCTGGACTC GGAGCAGCAT CCATGCCAGC TTAAGGACAA TGTCTGCTCC CCTGGGGGAG 900 
CCACCATCCA CGCCCTGCAC TTTCTAGAGA GTGGGGGCTT CCGCTCTCTG CTCATCAATG 960 

CAGTTGAGGC CTCCTGTATC CGAACACGAG AGCTACAGTC CATGGCCGAC CAAGAAAAGA 1020 

TCTCCCCAGC TGCCCTTAAG AAGACCCTCT TAGACAGAGT GAAGCTGGAA TCCCCCACAG 1080 

TCTCCACACT GACCCCCTCC AGCCCAGGGA AGCTCCTCAC AAGAAGCCTG GCCCTGGGAG 1140 

GCAAGAAGGA CTAAGGCAGC ATCTGTCCCC TCTGTGATTC AGAGCCCTTA GTTGAGAGCC 1200 

CCTGCCGCCC CTGCCACCCC CCTGCCCCGC TCCCACCATT GCCCCTCCTC AGCTGTGCAA 1260 

GGAGAAAGCA TGCTTAGGAA GTTTTCAGGT CCTTGTGATA AAACCTCCTT AAATCTGTTC 1320 

AGACCAAGCA ATGCGAGCTT CCTCTCCTGT CCCATGTTGG AAGTTGCTCT GAAGGGGTGG 13 80 

TAGATGCTGG AAGCCAGACA CAACCCTGCG TACGCTGCTC AGTTGGTGGA GACTGGGGCT 1440 

GGGACTGGAG TCAGCCCAGC TGGGAGGAGG GGCTGGGGAG GATCTGCAGC TGAAGCCCGA 1500 

GGCAGGGTTG GTGTGATGCC AAGGCAAAGT GGTGAGGAGA AAACAGGAAA CGGGCTTTCT 1560 

CTGAATTGGT AAATGGGAAA GAAGTGAGCA ACTTAAGATT GTCACAATTA ATCACAAGTG 1620 

TACAGGATTA GAC TGGGTTT ATATTTAACT CTTGCTTCAT AGGTGTACCA TTTAAAGAGT 1680 

GTTATTTAAT GCTAAGTTTA ACTGCTTTAA TAAAGTTTAT TTTTAAATAT CAAAAAAAAA 1740 

AA 1742 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 
{ D ) TOPOLOGY : 1 inear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GENBANK 

(B) CLONE: 189498 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Glu Glu Asp Leu lie Asp Ala Val Thr Gly III Ser Gly Ser Gly PrS 



Arg Leu Met Glu Gin Leu Leu Ser Thr Val Gly Phe Cys Thr Glu Val 
Glu Asp Leu 

Ala Tyr Ala Phe Thr Ala Leu Asp Ala Asp Gly Gly Val Lys Met Gly 

185 190 
Leu Pro Arg Arg Leu Ala Val Arg Leu Gly Ala Gin Ala Leu Leu Gly 

200 205 
Ala Ala Lys Met Leu Leu His Ser Glu Gin His Pro Gly Gin Leu Lys 

225 Ser ^ Gly Ala Thr m His Ala Leu His Val 

Leu Glu Ser Gly Gly Phe Arg Ser Leu Leu lie Asn Ala Val Glu m! 

^45 250 prr 

Ser Cys He Arg Thr Arg Glu Leu Gin Ser Met Ala Asp Gin Glu Gin 

265 om 
Val Ser Pro Ala Ala lie Lys Lys Thr lie Leu Asp Lys Vai Lys Leu 

280 285 
Asp Ser Pro Ala Gly Thr Ala Leu Ser Pro Ser Gly His Thr Lys Leu 



295 300 
Leu Pro Arg Ser Leu Ala Pro Ala Gly Lys Asp 

315 



305 310 
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